Exploring Compositional Data with the CoDa-Dendrogram

نویسندگان

  • Vera Pawlowsky-Glahn
  • Juan Jose Egozcue
چکیده

Abstract: Within the special geometry of the simplex, the sample space of compositional data, compositional orthonormal coordinates allow the application of any multivariate statistical approach. The search for meaningful coordinates has suggested balances (between two groups of parts)—based on a sequential binary partition of a D-part composition—and a representation in form of a CoDa-dendrogram. Projected samples are represented in a dendrogram-like graph showing: (a) the way of grouping parts; (b) the explanatory role of subcompositions generated in the partition process; (c) the decomposition of the variance; (d) the center and quantiles of each balance. The representation is useful for the interpretation of balances and to describe the sample in a single diagram independently of the number of parts. Also, samples of two or more populations, as well as several samples from the same population, can be represented in the same graph, as long as they have the same parts registered. The approach is illustrated with an example of food consumption in Europe.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Signal Interpretation in Hotelling’s T 2 Control Chart for Compositional Data

Nowadays, control of concentrations of elements is of crucial importance in industry. Concentrations are expressed in terms of proportions or percentages which means that they are compositional data (CoDa). CoDa are defined as vectors of positive elements that represent parts of a whole and usually add to a constant sum. Classical T 2 control chart is not appropriate for CoDa, for which is bett...

متن کامل

Spatial modelling of zonality elements based on compositional nature of geochemical data using geostatistical approach: a case study of Baghqloom area, Iran

Due to the existence of a constant sum of constraints, the geochemical data is presented as the compositional data that has a closed number system. A closed number system is a dataset that includes several variables. The summation value of variables is constant, being equal to one. By calculating the correlation coefficient of a closed number system and comparing it with an open number system, ...

متن کامل

Phytoplankton composition in shallow water ecosystems: influence of environmental gradients and nutrient availability

Environmental gradients caused by hydrological changes, whether natural or maninduced, affect the planktonic taxonomic and functional composition in shallow water ecosystems. In this sense, our aim was to find out the main variables or variable ratios that are the driving forces of the major phytoplankton taxonomic groups in Mediterranean coastal lagoons. For this purpose, 11 waterbodies were c...

متن کامل

Optimality Theoretic Account of Acquisition of Consonant Clusters of English Syllables by Persian EFL Learners*

This study accounts for the acquisition of the consonant clusters of English syllable structures both in onset and coda positions by Persian EFL learners. Persian syllable structure is "CV(CC)", composed of one consonant at the initial position and two optional consonants at the final position; whereas English syllable structure is "(CCC)V(CCCC)". Therefore, Persian EFL learners need to resolve...

متن کامل

Data on fatty acid profiles of green Spanish-style Gordal table olives studied by compositional analysis

This article contains processed data related to the research published in "Tentative application of compositional data analysis to fatty acid profiles of green Spanish-style Gordal table olives" (Garrido-Fernández et al., 2018) [1]. It provides information on the implementation of compositional data analysis (CoDa) to the fatty acid profiles of Spanish-style Gordal table olives vs the use of co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011